Y

YouLibs

Remove Touch Overlay

DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13]

Duration: 01:38:50Views: 9.7KLikes: 146Date Created: Sep, 2021

Channel: DeepMind

Category: Science & Technology

Description: Research Scientist Hado van Hasselt covers policy algorithms that can learn policies directly and actor critic algorithms that combine value predictions for more efficient learning. Slides: dpmd.ai/policygradient Full video lecture series: dpmd.ai/DeepMindxUCL21

Swipe Gestures On Overlay